Online Performance Management Using Hybrid Reinforcement Learning
نویسندگان
چکیده
We present a new hybrid approach to performance management, combining disparate strengths of Reinforcment Learning (RL) with model-based (e.g. queuing-theoretic) approaches. Our method trains nonlinear function approximators using offline RL on data collected while a model-based policy controls the system. By training offline we avoid potentially poor performance in live online training, while function approximation allows generalization across both states and actions, so that the need for exploratory actions may be greatly reduced. Our results show that, in a prototype resource allocation scenario among multiple web applications, hybrid RL training can achieve significant performance improvements over a variety of initial queuing model-based policies. We also find that, as expected, RL can deal effectively with both transients and switching delays, which lie outside the scope of traditional steady-state queuing theory.
منابع مشابه
Hybrid Cooperative Agents with Online Reinforcement Learning for Traffic Control
This paper presents the application of fuzzy-neuroevolutionary hybrid system with online reinforcement learning for intelligent road traffic management and control. Taking a step away from the conventional traffic control system, the hybrid system presents different methodologies in knowledge acquisition, decisionmaking, learning and goal formulation with the use of a three-layered hierarchical...
متن کاملEnergy Management Strategy for a Hybrid Electric Vehicle Based on Deep Reinforcement Learning
An energy management strategy (EMS) is important for hybrid electric vehicles (HEVs) since it plays a decisive role on the performance of the vehicle. However, the variation of future driving conditions deeply influences the effectiveness of the EMS. Most existing EMS methods simply follow predefined rules that are not adaptive to different driving conditions online. Therefore, it is useful tha...
متن کاملReinforcement Learning Based PID Control of Wind Energy Conversion Systems
In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...
متن کاملHybrid Solving for Quantified Boolean Formulas Based on SVM and Reinforcement Learning ⋆
In this paper, a new SVM classification algorithm is proposed, this algorithm is applied in Quantified Boolean Formulas (QBF) hybrid solving and a new QBF hybrid solver is designed. This solver apply SVM algorithm to construct inductive models and classify the formulae. At the same time, the reinforcement learning technology is applied to realize the dynamic algorithm selection. The relationshi...
متن کاملHybrid coordination of reinforcement learning-based behaviors for AUV control
This paper proposes a Hybrid Coordination method for Behavior-based Control Architectures. The hybrid method takes in advantages of the robustness and modularity in competitive approaches as well as optimized trajectories in cooperative ones. This paper shows the feasibility of this hybrid method with a 3D-navigation application to an Autonomous Underwater Vehicle (AUV). The behaviors were lear...
متن کامل